NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Statistical Inference for Covariate-Adjusted and Interpretable Generalized Factor Model with Application to Testing Fairness

Ouyang, Jing; Cui, Chengyu; Tan, Kean Ming; Xu, Gongjun (October 2025, Annals of applied statistics)

Free, publicly-accessible full text available October 31, 2026
Bridging Human and LLM Judgments: Understanding and Narrowing the Gap

Polo, Felipe Maia; Wang, Xinhe; Yurochkin, Mikhail; Xu, Gongjun; Banerjee, Moulinath; Sun, Yuekai (October 2025, NeurIPS 2025)

Free, publicly-accessible full text available October 31, 2026
Detecting Differential Item Functioning across Multiple Groups using Group Pairwise Penalty

https://doi.org/10.1017/psy.2025.10034

Lyu, Weicong; Wang, Chun; Xu, Gongjun (August 2025, Psychometrika)

Free, publicly-accessible full text available August 11, 2026
A Novel Method for Detecting Intersectional DIF: Multilevel Random Item Effects Model with Regularized Gaussian Variational Estimation

https://doi.org/10.1017/psy.2025.10046

Ren, He; Lyu, Weicong; Wang, Chun; Xu, Gongjun (September 2025, Psychometrika)

Abstract Differential item functioning (DIF) screening has long been suggested to ensure assessment fairness. Traditional DIF methods typically focus on the main effects of demographic variables on item parameters, overlooking the interactions among multiple identities. Drawing on the intersectionality framework, we define intersectional DIF as deviations in item parameters that arise from the interactions among demographic variables beyond their main effects and propose a novel item response theory (IRT) approach for detecting intersectional DIF. Under our framework, fixed effects are used to account for traditional DIF, while random item effects are introduced to capture intersectional DIF. We further introduce the concept of intersectional impact, which refers to interaction effects on group-level mean ability. Depending on which item parameters are affected and whether intersectional impact is considered, we propose four models, which aim to detect intersectional uniform DIF (UDIF), intersectional UDIF with intersectional impact, intersectional non-uniform DIF (NUDIF), and intersectional NUDIF with intersectional impact, respectively. For efficient model estimation, a regularized Gaussian variational expectation-maximization algorithm is developed. Simulation studies demonstrate that our methods can effectively detect intersectional UDIF, although their detection of intersectional NUDIF is more limited.
more » « less
Free, publicly-accessible full text available September 15, 2026
Consistency Theory of General Nonparametric Classification Methods in Cognitive Diagnosis

https://doi.org/10.1017/psy.2025.9

Cui, Chengyu; Liu, Yanlong; Xu, Gongjun (March 2025, Psychometrika)

Abstract Cognitive diagnosis models (CDMs) have been popularly used in fields such as education, psychology, and social sciences. While parametric likelihood estimation is a prevailing method for fitting CDMs, nonparametric methodologies are attracting increasing attention due to their ease of implementation and robustness, particularly when sample sizes are relatively small. However, existing consistency results of the nonparametric estimation methods often rely on certain restrictive conditions, which may not be easily satisfied in practice. In this article, the consistency theory for the general nonparametric classification method is reestablished under weaker and more practical conditions.
more » « less
Free, publicly-accessible full text available March 17, 2026
Multi-Group Regularized Gaussian Variational Estimation: Fast Detection of DIF

https://doi.org/10.1017/psy.2024.15

Lyu, Weicong; Wang, Chun; Xu, Gongjun (March 2025, Psychometrika)

Abstract Data harmonization is an emerging approach to strategically combining data from multiple independent studies, enabling addressing new research questions that are not answerable by a single contributing study. A fundamental psychometric challenge for data harmonization is to create commensurate measures for the constructs of interest across studies. In this study, we focus on a regularized explanatory multidimensional item response theory model (re-MIRT) for establishing measurement equivalence across instruments and studies, where regularization enables the detection of items that violate measurement invariance, also known as differential item functioning (DIF). Because the MIRT model is computationally demanding, we leverage the recently developed Gaussian Variational Expectation–Maximization (GVEM) algorithm to speed up the computation. In particular, the GVEM algorithm is extended to a more complicated and improved multi-group version with categorical covariates and Lasso penalty for re-MIRT, namely, the importance weighted GVEM with one additional maximization step (IW-GVEMM). This study aims to provide empirical evidence to support feasible uses of IW-GVEMM for re-MIRT DIF detection, providing a useful tool for integrative data analysis. Our results show that IW-GVEMM accurately estimates the model, detects DIF items, and finds a more reasonable number of DIF items in a real world dataset. The proposed method has been integrated intoRpackageVEMIRT(https://map-lab-uw.github.io/VEMIRT).
more » « less
Free, publicly-accessible full text available March 1, 2026
High-dimensional Factor Analysis for Network-linked data

https://doi.org/10.1093/biomet/asaf012

Li, Jinming; Xu, Gongjun; Zhu, Ji (February 2025, Biometrika)

Abstract Factor analysis is a widely used statistical tool in many scientific disciplines, such as psychology, economics, and sociology. As observations linked by networks become increasingly common, incorporating network structures into factor analysis remains an open problem. In this paper, we focus on high-dimensional factor analysis involving network-connected observations, and propose a generalized factor model with latent factors that account for both the network structure and the dependence structure among high-dimensional variables. These latent factors can be shared by the high-dimensional variables and the network, or exclusively applied to either of them. We develop a computationally efficient estimation procedure and establish asymptotic inferential theories. Notably, we show that by borrowing information from the network, the proposed estimator of the factor loading matrix achieves optimal asymptotic variance under much milder identifiability constraints than existing literature. Furthermore, we develop a hypothesis testing procedure to tackle the challenge of discerning the shared and individual latent factors’ structure. The finite sample performance of the proposed method is demonstrated through simulation studies and a real-world dataset involving a statistician co-authorship network.
more » « less
Free, publicly-accessible full text available February 21, 2026
tinyBenchmarks: evaluating LLMs with fewer examples

Maia_Polo, Felipe; Weber, Lucas; Choshen, Leshem; Sun, Yuekai; Xu, Gongjun; Yurochkin, Mikhail (July 2024, Proceedings of Machine Learning Research)

Full Text Available
tinyBenchmarks: evaluating LLMs with fewer examples

Maia_Polo, Felipe; Weber, Lucas; Choshen, Leshem; Sun, Yuekai; Xu, Gongjun; Yurochkin, Mikhail (July 2024, Proceedings of Machine Learning Research)

Full Text Available
Sufficient and Necessary Conditions for the Identifiability of DINA Models with Polytomous Responses

https://doi.org/10.1007/s11336-024-09961-w

Lin, Mengqi; Xu, Gongjun (March 2024, Psychometrika)

Full Text Available

« Prev Next »

Search for: All records